Optimal Randomized Approximations for Matrix-Based Rényi’s Entropy
نویسندگان
چکیده
The Matrix-based Rényi’s entropy enables us to directly measure information quantities from given data without the costly probability density estimation of underlying distributions, thus has been widely adopted in numerous statistical learning and inference tasks. However, exactly calculating this new quantity requires access eigenspectrum a semi-positive definite (SPD) matrix $A$ which grows linearly with number samples notation="LaTeX">$n$ , resulting notation="LaTeX">$O(n^{3})$ time complexity that is prohibitive for large-scale applications. To address issue, paper takes advantage stochastic trace approximations matrix-based arbitrary notation="LaTeX">$\alpha \in \mathbb {R}^{+}$ orders, lowering by converting approximation matrix-vector multiplication problem. Specifically, we develop random integer-order $ cases polynomial series (Taylor Chebyshev) fractional cases, leading notation="LaTeX">$O(n^{2}sm)$ overall complexity, where notation="LaTeX">$s, m \ll n$ denote vector queries order respectively. We theoretically establish guarantees all algorithms give explicit notation="LaTeX">$s$ notation="LaTeX">$m$ respect error notation="LaTeX">$\epsilon showing optimal convergence rate both parameters up logarithmic factor. Large-scale simulations real-world applications validate effectiveness developed approximations, demonstrating remarkable speedup negligible loss accuracy.
منابع مشابه
Entropy-based nonlinear viscosity for Fourier approximations of conservation laws
An Entropy-based nonlinear viscosity for approximating conservation laws using Fourier expansions is proposed. The viscosity is proportional to the entropy residual of the equation (or system) and thus preserves the spectral accuracy of the method. To cite this article: J.-L. Guermond, R. Pasquetti, C. R. Acad. Sci. Paris, Ser. I 346 (2008). © 2008 Académie des sciences. Published by Elsevier M...
متن کاملPartitions for Optimal Approximations
The Riemann integral can be approximated using partitions and a rule for assigning weighted sums of the function at points determined by the partition. Approximation methods commonly used include endpoint rules, the midpoint rule, the trapezoid rule, Simpson’s rule, and other quadrature methods. The rate of approximation depends to a large degree on the rule being used and the smoothness of the...
متن کاملEntropy-based Consensus for Distributed Data Clustering
The increasingly larger scale of available data and the more restrictive concerns on their privacy are some of the challenging aspects of data mining today. In this paper, Entropy-based Consensus on Cluster Centers (EC3) is introduced for clustering in distributed systems with a consideration for confidentiality of data; i.e. it is the negotiations among local cluster centers that are used in t...
متن کاملExponential Models: Approximations for Probabilities
Welch & Peers (1963) used a root-information prior to obtain posterior probabilities for a scalar parameter exponential model and showed that these Bayes probabilities had the confidence property to second order asymptotically. An important undercurrent of this indicates that the constant information reparameterization provides location model structure, for which the confidence property ...
متن کاملEIGENVECTORS OF COVARIANCE MATRIX FOR OPTIMAL DESIGN OF STEEL FRAMES
In this paper, the discrete method of eigenvectors of covariance matrix has been used to weight minimization of steel frame structures. Eigenvectors of Covariance Matrix (ECM) algorithm is a robust and iterative method for solving optimization problems and is inspired by the CMA-ES method. Both of these methods use covariance matrix in the optimization process, but the covariance matrix calcula...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Transactions on Information Theory
سال: 2023
ISSN: ['0018-9448', '1557-9654']
DOI: https://doi.org/10.1109/tit.2023.3260122